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Abstract 

This paper extends the classical consumption and portfolio rules 
model in continuous time (Merton 1969, 1971) to the framework of 
decision-makers with time-inconsistent preferences. The model is sol- 
ved for different utility functions for both, naive and sophisticated 
agents, and the results are compared. In order to solve the problem 
for sophisticated agents, we derive a modified HJB (Hamilton- Jacob i- 
Bellman) equation. It is illustrated how for CRRA functions within 
the family of KARA functions (logarithmic and potential cases) the 
optimal portfolio rule does not depend on the discount rate, but this 
is not the case for a general utility function, such as the exponential 
(CARA) utility function. 
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1 Introduction 



Variable rate of time preferences have received considerable attention in re- 
cent years. Virtually every experimental study on time preferences suggests 
that the standard assumption of time-consistency (related to the assumption 
of constant discount rate of time preference) is unrealistic (see, for instance, 
Thaler (1981), Ainslie (1992) or Loewenstein and Prelec (1992)). In fact, 
there is substantial evidence that agents are impatient about choices in the 
short term but are patient when choosing between long-term alternatives. 

Effects of the so called hyperbolic discount functions, introduced by Phelps 
and PoUak (1968), have been extensively studied in a discrete time context, 
within the field of behavioral economics (for a recent application in the eco- 
nomics of information systems, see Tomak and Keskin (2008)). Laibson 
(1997) has made compelling observations about ways in which rates of time 
preference vary. However, this topic has received less attention in a continu- 
ous time setting. The main reason for this may be the complexity involved in 
the search for solutions in closed form in the non-constant discounting case. 
In fact, standard optimal control techniques cannot be used in this context, 
since they give rise to non-consistent policies. 

The most relevant effect of non-constant discounting is that preferences 
change with time. An agent making a decision in time t has different prefer- 
ences compared with those at time t'. Therefore, we can consider him or her 
at different times as different agents. An agent making a decision at time t 
is usually called the agent. If the planning horizon is a finite interval [0, T], 
we can understand the dynamic optimization problem with non-constant dis- 
counting as a perfect information sequential game with a continuous number 
of players (the t-agents, for t G [0,T]) making their decisions sequentially. A 
i-agent can act in two different ways: naive and sophisticated. 

Naive agents take decisions without taking into account that their pref- 
erences will change in the near future. Then, they will be continuously mod- 
ifying their calculated choices for the future, and their decisions will be in 
general time-inconsistent. In order to obtain a time consistent strategy, the 
t-agent should be sophisticated, in the sense of taking into account the pref- 
erences of all the t'-agents, for t' G {t,T]. Therefore, the solution to the 
problem of the agent with non-constant discounting should be constructed 
by looking for the subgame perfect equilibria of the associated game with an 
infinite number of t-agents. 

Historically, in that part of his analysis allowing for time preference, Ram- 
sey (1928) assumed an exponential discount factor with constant discount 
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rate, stating: "This is the only assumption we can make, without contra- 
dicting our fundamental hypothesis that successive generations are activated 
by the same system of preferences". The main property of non-constant 
discounting is implicit in this statement: it can create a time-consistency 
problem. In fact, Strotz (1956) illustrated how, for a very simple model, 
preferences are time consistent if, and only if, the discount factor represent- 
ing time preferences is an exponential with a constant discount rate. In order 
to avoid such time inconsistency, agents could decide in a sophisticated way, 
making an analysis of what their actions would be in the future, as a con- 
sequence of their changing preferences. For instance, Pollak (1968) gave the 
right solution to the Strotz problem for both naive and sophisticated agents 
under a logarithmic utility function. 

Although the problem was first presented in a continuous time context 
(Strotz (1956)), almost all attention has been given to the discrete time set- 
ting introduced by Phelps and Pollak (1968). This is probably a consequence 
of the non-existence of a well-stated system of equations giving a general 
method for solving the problem, at least for sophisticated agents. Therefore, 
each particular problem has been solved individually. This was the case of 
the Strotz model solved by Pollak in 1968. Barro (1999) studied a modified 
version of the neoclassical growth model by including a variable rate of time 
preference. 

For the case of naive agents, one should solve a standard optimal control 
problem for each time t e [0,r], in order to find the decision rule at time 
i of a i-agent. Unfortunately, this method cannot be used if the agent is 
sophisticated. Instead, Markov subgame perfect equilibria must be found. 
This prompts the use of a dynamic programming approach, applying the 
Bellman optimality principle. 

To solve the intra-personal game for sophisticated agents, a continuous- 
time model of quasi-hyperbolic time preferences was introduced in Harris and 
Laibson (2008). In their model, the discount rate declines during the first "pe- 
riod" (instantaneous gratification), and then becomes constant. Grenadier 
and Wang (2007) employed this model to extend the real options frame- 
work in order to analyze the investment-timing decisions (in an irreversible 
investment framework) of entrepreneurs with time-inconsistent preferences. 

In a deterministic environment, Karp (2007) adapted the approach by 
Harris and Laibson (2008) for the general case of an arbitrary discount rate 
of time preference, for autonomous infinite time horizon problems. The free 
terminal time case in non-autonomous problems in finite horizon was ana- 
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lyzed in Marm-Solano and Navas (2009). An alternative approach to the 
problem was given in Ekeland and Lazrak (2008). 

In this paper we extend the results by Karp (2007) and Marin-Solano 
and Navas (2009) to a stochastic environment, in order to analyze how time- 
inconsistent preferences modify the classical optimal consumption and port- 
foho rules when the discount rate is constant (Merton (1969), (1971)). As 
expected, the rate of time preference plays no role in a pure optimal portfo- 
lio management problem. However, if the consumption is introduced in the 
model, an inter-temporal conflict arises. We show that, within the HARA 
(hyperbolic absolute risk aversion) functions, if the relative risk aversion is 
constant (logarithmic and potential utility functions), the optimal portfolio 
rule does not depend on the rate of time preference, although the consump- 
tion rule changes. This nice property is not satisfied for more general utility 
functions, such as the (constant absolute risk aversion) exponential function. 

The paper is organized as follows. In Section 2 we describe the model. 
The general stochastic dynamic optimization problem with non-constant dis- 
count rate of time preference is studied in Section 3, and the dynamic pro- 
gramming (Hamilton- Jacobi-Bellman) equation is derived. In Section 4, this 
equation is solved for the "optimal" (in fact, equilibrium) consumption and 
portfoho rules problem for some particular utility functions. The so-called 
pre-commitment, naive and sophisticated solutions are compared. Finally, 
Section 5 contains the main conclusions of the paper. 



2 The Model 

Let X = (x^, . . . , x'^) e X be the vector of state variables, u = (m^, . . . , u^) e 
U the vector of control (or decision) variables, C{x{s),u{s), s) the instanta- 
neous utility function at time s, and F{x{T)) the final (bequest) function. 
In the conventional model, agent preferences at time t take the form 



e-''(^-*)£(x(s), ii(s), s) ds + e-''MF(x(T)) 



(1) 



where the state variables evolve according to the diffusion equations 

L 

dx\s) ^ f{x{s),u{s),s)ds + Y.^\{x{s),u{s),s)dw\s) , x\t)^x'^ , (2) 



for i = 1, . . . , n, where {w^{s), . . . , w^{s)) is an L-dimensional Wiener process 
with independent components {dw\s)dw^' {s) — 0, for I ^ V). We will denote 
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S = (o";), for i = 1, . . .n, I = 1, . . . , L. In order to maximize Ut, we must 
solve a stochastic optimal control problem, and since the discount rate is 
constant, the solution becomes time consistent. 

Now, following Karp (2007), let us assume that the instantaneous discount 
rate is non-constant, but a function of time r(s), for s G [0,T]. Impatient 
agents will be characterized by a non- increasing discount rate r{s). The 
discount factor at time t used to evaluate a payoff at time t + r, r > 0, is 
9{r) = exp r (s) dsy Then, the objective of the agent at time t (the 

t-agent) will be 

cT 



max E 

Ms)} 



e{s - t)C{x{s),u{s), s) ds + e{T - t)F{x{T)) 



(3) 



In Problem ([2H3]), we assume the usual regularity conditions, i.e., functions 
£, F, and a] are continuously differentiable in all their arguments. 

In the discrete time case, most papers work with the so-called hyperbolic 
discounting, first proposed by Phelps and Pollak (1968). The utility function 
is defined as Ut = Ut + l3{6ut+i + 6'^Ut+2 + ^^Ut+s + ■■■), where < (3 < 1, 
and Uk denotes the utility in period k. In fact, Laibson (1997) argues that /3 
would be substantially less than one on an annual basis, perhaps between one- 
half and two-thirds. Harris and Laibson (2008) adapted this inter-temporal 
utility function to the continuous time setting. As a natural extension of the 
above discount function to the continuous setting, Barro (1999) suggested 
the instantaneous discount rate r(r) = p + be~'^'^ , where b > and 7 > 
(for the general case, he defined the discount factor as 9{t) = e^''^^^'^'^'^^]). In 
other applications it is natural to assume that the discount factor is a linear 
combination of exponentials with constant but different discount rates. We 
will not assume any particular discount function. 

In this paper we are interested in the optimal consumption and portfolio 
rules in continuous time studied by Merton (1969, 1971). Let us assume 
that there are m risky assets and one risk-free asset. The risk-free asset pays 
a constant rate fiQ, while the return of i-th risky asset follows a geometric 
Brownian motion 

dPi = piPids + aiPidzi , i = 1, . . . , m , 

with dzidzj = Pijdt for i, j = 1, . . . m, i j, and pi, ai are constants. 

If Wi is the share of wealth invested in the z-th risky asset, and c denotes 
the consumption, the consumer's budget equation is 



dW 



^Wi{pi - po)W + {poW 



i=l 



ds + ^ WiaiWdzj , (4) 



i=l 
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with the initial condition Wq. Then the consumer-investor's problem is 



max E 

{c,Wj} 



e{s - t)u{cs) ds + e{T - t)F{x{T)) 



(5) 



s.t. (jl]) with the initial condition W{t) = Wf 

3 Dynamic Programming Equation 

For the solution of Problem ([2H3]) (and, in particular, Problem (jHE])), if 
the agent is naive, then we can adapt the standard techniques of stochastic 
optimal control theory as follows. If = V^{x, s) is the value function, the 
0-agent will solve the standard Hamilton- Jacobi-Bellman equation 



r(s)y° - K° = max !^C + V^'-f + hr (SSVi^.) | 



x(0) = xo 



i.e., the naive agent at time solves the problem, assuming that the discount 
rate of time preference will be r{s), for s G [0,T]. In the equation above 
we denote V^' = (g, • ■ • , g), and = for z, j = 1, . . .n. The 

optimal control will be a function u^{s). In our framework of changing pref- 
erences, this solution corresponds to the so-called pre-commitment solution, 
in the sense that it is optimal as long as the agent can precommit (by signing 
a contract, for example) his or her future behavior at time t = 0, and it will 
be denoted by = V^. If there is no commitment, the 0-agent will take 
the action m°(0) but, in the near future, the e-agent will change his decision 
rule (time-inconsistency) to the solution of 

r(s - e)V^ - V: = rn^^ + ■ / + ^tr (SSV^) | , x{e) = x, . 

Once again the optimal control trajectory u^{s), s G [e, T] will be changed for 
s > e by the following s-selves. In general, the solution for the naive agent 
will be constructed by solving the family of HJB equations 

r(s - t)V' - = max f^C + V^-f + ^tr (SSV^) | , x(t) = x, , 

for if: G [0,T], and patching together the "optimal" solutions M*(t). 

If the agent is sophisticated, things become more complicated. The stan- 
dard HJB equation cannot be used to construct the solution, and a new 
method is required. In what follows, we will derive a modified HJB equation 
which will help us to find the solution to Problem (^j^ and then 
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3.1 The discrete time case 



We will derive first the dynamic programming equation for a discretized 
version of Problem ([2113]), following a procedure similar to the one used in 
Karp (2007) and Marfn-Solano and Navas (2009). 

Let us divide the interval [0, T] into periods of constant length e, in 
such a way that we identify ds = e, and s = je, for j = 0, 1, . . . , A^. Denoting 
by x(je) = Xj, u{ke) = Uk {j, k = 0, . . . , N — 1), the objective of the agent in 
period t = je will be 



max Vj = E 



■N-j-l 

^ 9{ze)C{x,+„ u,+„ (z + j»e + 9{{N - j»F(x(T)) 

. i=0 



4+1 = xl + f{xk,Uk,ke)e + ^al{xk,Uk,ke){wi^^ - wi) 



(6) 
(7) 



1=1 



for i = 1, . . .n and k = j, . . . , N — 1, with Xj given. 

Let us state the dynamic programming algorithm for the discrete problem 
(ElED. In the final period, t = Ne = T,we define 1/^ = F{xn)- For j = 
the optimal value for ([6]) will be given by the solution to the problem 



^(7v-i) = ,inax^ E [C{xn-i, un-i, {N - l)e)e + 6iV^ 



{«iV-l} 



Ni ) 



N 



X 



-1 + rixN-i,UN-^i,iN - l)e)e + Ya=i^\{^n-\-,Un-\A^ 



N- 



with X 

l)e)(tt;jv ~ wjy_]^), for i = 1, . . .n. If u*j^_i{xn-i, {N — l)e) is the maximizer 
of the right hand term of the above equation, let us denote 

Hm-i{xn-i, {N - l)e) = C{xN-i,u%_,{xM,i, {N - l)e), {N - l)e) . (8) 

In general, for j = 1, . . . , — 1, the optimal value in ([6]) can be written as 



V* = max E 



N-j-l 



{Uj} 

Since 

V;*+i(xo-+i),(j+l)e) = i? 



C{xj, Uj,je)e + ^ OkHj+kixj+k, (j + k)e)e + On-jV* 



k=l 



(9) 



■N-j-2 



^ 9iHj+i^i{xj+i+i, {j + i + l)e)e + On-j-iV^ 

1=0 

(10) 

then, solving 9N-j-iV^{xi\i) in (fTOil and substituting in ([9]) we obtain: 
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Proposition 1 For every initial state Xq, the equilibrium value V* of prob- 
lem can be obtained as the solution of the following algorithm, which 
proceeds backward in time from period N — 1 to period 0: 

ON-j-iV*{xj,je) = raaxE[9N_j_iC{xj,Uj,je)e + 
mi 

N-j-l 

{dN-j-idk — dN-jdk-i) Hj^k{xj+k, {j + k)e)e+ (11) 

k=l 

+^iv-,V;.;i(x,+i, (j + l)e)] , V* = F(x(T)) , 

L 

4+1 = 4 + f\xj, Uj,je)e + J2 ^li^j^ - ^) . (12) 

(=1 

for i = 1, . . . ,n, j = 0, . . . , N — 1. Equations OTlfT^) are the equilibrium 
dynamic programming equations in discrete time, and their solution is the 
Markov Perfect Equilibrium (MPE) solution to problem (HQ). 

In the proof of the proposition we have implicitly assumed that the functions 
Vj and V* given by (El) and ( ITTi) . respectively, are well defined and finite. 

3.2 The continuous time case: a heuristic approach 

In order to solve Problem (l2H3l) for a sophisticated agent, first we need to 
define what we mean by a Markov equilibrium. Recall that the concept 
of optimality plays no role here, since what is optimal for the t- agent will 
not be optimal (in general) for the s-agents, s > t. A natural approach 
to the problem consists in considering first the equilibrium of a sequence of 
planners in discrete time (as we have done in the previous section) and then 
passing to the continuous time limit. Hence, the equilibrium value function 
to Problem ([21EI) is defined as the limit when e — > of the discrete stage 
equilibrium Problem (I6H3) , and the Markov Perfect Equilibrium is defined as 
the solution to the dynamic programming equation obtained as the (formal) 
limit when e — of equations ( |TTIfT2l) . This is probably the most intuitive 
approach, and was the one used (in a deterministic setting) in Karp (2007). 
By following these ideas, we will derive a "modified" HJB equation in a 
heuristic way. However, this approach is not rigorous, since the pass to the 
limit is "formal", and needs to be mathematically justified. For a given 
equilibrium rule u{x,s), a condition assuring the uniform convergence (in 
the mean square sense) of the solution to the discretized equation ([7]) to the 
true solution to (E]) is that /* and (t\ satisfy uniform growth and Lipschitz 
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conditions in x, and are Holder continuous of order 1/2 in the second variable. 
In the next section we will follow a different and rigorous approach, similar 
in spirit to the one first suggested in Barro (1999), proving a theorem for the 
modified HJB equation. 

According to the previous definition, let us assume that the equilibrium 
value function V^{xt,t) of the sophisticated t-agent, with initial condition 
x{t) = Xt, is of class in x, and of class in t (i.e., of class C^'^). Since 
t = je and x''{t + e) = x\t) + /*(a;(t), u(t), t)e + ^/l^ a,*(x(t), M(t), t)(u;'(t + 
e) — w'(t)), then V^{xt,t) = Vj{xj,je) and 



V^ixt+e,t + e) = V''^ixt,t) + 



{xt,u(t),t) 



n L 



+EE 



a 



=1 1=1 



' dx' 



{w\t + e) - w\t)) + o{e) 



Since 9k = exp J^'' r{s)ds^ , then Oj^_j = 6'jv_j_i [1 — r((A^ — j)e)e] + 
o(e) = 9n-j~i [1 - r(T - t)e] + o(e) and 9k^i = 9k[l + r{ke)e] + o(e), and 
substituting in ( |TT1) and simphfying we obtain 



= max E 

Mt)} 



(c + Vf + ■ / + (EEV/J - r(T - t) V^(x„ t) - K^e 



n L 



i=l 1=1 

where K{xt, t) is given by 



{xt,u(t),t)^ 



(13) 



K{xt,t) = E 



'n-j-l 



9{ke) [r{ke) - r{T - t)] Ht+ke{x{t + ke),t + ke)e 



k=l 



(14) 

Dividing equation (|T3l) by e, and taking the limit e — > (and hence n oo, 
since T = ne) in (fT3i) and (|T4l) . we obtain the modified HJB equation 



r(^T - t)V' + - ^ = max |£ + ■ / + -tr (SEV/,) j 



(15) 



where 



K{xt,t)=E 



T-t 



9{s) [r{s) - r{T - t)] H{x{t + s), t + s)ds 



E 



9{s-t)[r{s-t) ~r{T -t)]H{xs,s)ds 
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and H{xs, s) = C{xs,u*{xs, s), s). Finally, note that for the equilibrium rule 
u* = u*{xs,s), s G [t,T], by solving the stochastic differential equation ([2]) 
we can write function of Xt and s {xt+s = x{xt,s)). Therefore, 

H{xs,s) = H{xt,s) and the Markov Perfect Equilibrium (MPE) u*{x,t), 
t G [0,T], is obtained by solving ( ITSl) with 



Remark 1 Assume now that, in addition to the initial state x(0) = Xq, 
the final state x{T) = xt is given. In this case, the terminal condition 
W{T,x) = F{x) makes no sense (x{T) is fixed). Instead, we have the extra 
condition x{T) = xt in order to integrate the differential equations. 

Remark 2 As we have commented previously, the above derivation of the 
modified HJB equation is heuristic. The pass to the limit has to be math- 
ematically justified. In stochastic optimal control, in Fleming and Soner 
(2006) the convergence of finite difference approximations to HJB equations 
is discussed. However, the convergence is proved by using a method based 
on viscosity solution techniques, and therefore it is not applicable to equation 
173]) ■ which is not a partial differential equation due to the presence of a non- 
local term. In fact, for a numerical resolution of the problem in continuous 
time, the extension of the results on the convergence of numerical methods to 
the value function presented in Kushner and Dupuis (2001) is a topic which 
deserves attention, although such extension seems to be not straightforward. 

If there is no final function, equations (11511171) can be written as follows: 

Corollary 1 //, in Problem ^MBij, there is no final function (F{x{T)) = 0), 
then the modified HJB equation can be written as 




(16) 




■T 



e{s - t) [r{s - t) - r(T - t)] H{x, s)ds 



(17) 




where 




■T 




E 



and \/^(a;,T) = 0. 
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Proof: Along the equilibrium path, 



T 

S 



V''{x,t) = E 
Therefore 



9{s - t)H(x, s)ds + 9{T - t)F{x{T)) 



ut 



rT 

K{x,t) = E' ' 



e{s-t)r{s-t)H{x,s)ds-r{T -t) e{s - t)H{x, s)ds 



= K{x, t) - r{T - t)V^{x, t) - 9{T - t)r{T - t)E [F{x{T))] . 
If F = 0, the result follows by substituting the expression above in (fT5l) . □ 

If the discount rate is non-constant but the 0-agent can precommit at 
time t = his or her future behavior, then the corresponding (classical) HJB 
equation characterizing the pre-commitment solution becomes 



Vf = max |£ + Vf . / + ltr(SE'V;0| . 



When comparing equations (1181) and (iTSll . there are two differences. First, 
the term r{t)V{x,t) in (JTSll changes to r(T — t)V''^{x,t). Second, and more 
importantly, a new term K{x,t) appears in equation ffT^ . This new term 
involves the utility function C. Note that, in ffT7|) . H{x,s) is essentially the 
function C evaluated at the equilibrium rule. This is a substantial change 
with respect to the standard HJB equation. If the discount rate is constant, 
K = and we recover the usual HJB equation. Otherwise, this extra term 
has to be added, and equation (fT^ becomes an integro-differential equation. 
This fact determines a substantial increase in the complexity of the mathe- 
matical treatment. Moreover, the modified HJB equation given by fll5tll7p 
appears not to be very useful, insofar as it includes implicitly the equilibrium 
rule (that is, the solution to the problem) in the definition of K{x, t) (via 
H{x,s) = C{x*{x,u*{x, s), s),u*{x, s), s)). In this paper we illustrate how 
equations ( ITSlfTTl) can be used in order to solve a consumption and portfolio 
rules problem, by applying a guessing method for searching the solution. 

Another fundamental difference between stochastic optimal control the- 
ory and Problem (j2H3]) comes from the fact that, since the problem with 
non-constant discount rate of time preference is equivalent to a game with 
a continuous number of agents, each of whom wants to maximize the ex- 
pected present discounted value of current and future welfare, the notion 
of optimality is substituted by that of Markov Perfect Equilibrium. If such 
equilibria are non unique, the concept of Pareto optimality should be applied 
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here. Such non-uniqueness of candidate equihbria is usual in an infinite time 
setting, and was addressed in Karp (2007) (in a deterministic infinite horizon 
context), where a Pareto ranking of steady states was estabhshed. In the con- 
sumption and portfoho rules problem with a finite planning horizon studied 
in this paper the MPE is unique, and therefore this problem is avoided. 

Things are much easier in the (Mayer) problem, where we are just in- 
terested in maximizing a final expected utility (£ = in ([3])). If the 
discount rate is constant, it is clear that the optimal solution is indepen- 
dent of the discount rate. It is straightforward to see that this property 
is preserved in the case of a non-constant discount rate of time preference, 
not only for the pre-commitment and naive solutions (where actually we 
are solving standard optimal control problems), but also for sophisticated 
agents. For instance, let V be the solution when r = 0. Then V verifies 
—Vt = max{„} {Vx ■ f + \tr {TjTj'Vxx)^- Now, if is the value function for a 
sophisticated agent with (arbitrary) non-constant discounting, from (ITSi) we 
obtain {K = in this case) V^{x,t) = 6{T — t)V{x,t). Therefore, although 
the value function changes in a factor 6{T — t), the optimal/equilibrium 
control-state pair coincides for both problems. Hence, in a pure optimal 
portfolio management problem such as Problem (jMS]) where we omit con- 
sumption (m(c) = c = in the model), the introduction of time-inconsistent 
preferences does not add anything new. However, in problems where the final 
time T is a decision variable (T is not prefixed), the changing preferences of 
the decision-maker will modify the optimal solution, in general (the equilib- 
rium final time will be different for naive and sophisticated agents, and for 
different discount rates, see Marin-Solano and Navas (2009)). 

3.3 The Modified HJB Equation 

In this section we provide a mathematical justification of the dynamic pro- 
gramming equations fll5H17l) . We will follow the underlying idea used (in a 
deterministic setting) in Barro (1999) for the derivation of the equilibrium 
rules, consisting in assuming that the decision-maker at time t can precom- 
mit his future behavior during the period [t, t + e] . In Ekeland and Lazrak 
(2008) this idea was reformulated by considering that the t-agent is allowed 
to form a coahtion with his immediate successors (s-agents, with s G [t, t+e]), 
provided that, for s > t + e, the corresponding s-agents choose their equi- 
librium rule. Then, the equilibrium rule was calculated by taking the limit 
e ^ 0. It is remarkable that the equilibrium necessary conditions obtained 
(in a deterministic setting) in Karp (2007) and Ekeland and Lazrak (2008)) 



12 



are consistent, although the two approaches are different in nature. Very 
recently, in Ekeland and Pirvu (2008a, 2008b), the approach in Ekeland and 
Lazrak (2008) has been extended to a stochastic setting, and applied to some 
portfolio management problems, providing an alternative approach to ours. 
We refer the reader to these papers for a formal presentation and a mathe- 
matically rigorous analysis of the non-standard optimal control problem with 
non-constant discounting of time preference. 

Let us assume that the t-agent can precommit his future behavior during 
the period [t, t + e]. Then, given an equilibrium rule u{x, s), for s > t + e. 



V^{xt,t) = max E^^^t 
{"(*)} 



9{s - t)C{x{s), u{s), s)ds + 9{T - t)F{x{T)) 



max Ea,^^t 

{u(t)} 



t+e 



[s — t)C{x{s), u{s), s)ds+ 



(19) 



+ E, 



[s - t)H{x{s), s)ds + 9{T - t)F{x{T)) 



Ut+e 



where H{x{s), s) = C{x{s), u*{x{s), s),s), with u*{x, s) the equilibrium rule, 
is defined as in the previous section. Note that, due to the non-constant 
discounting, we cannot write the equation above in terms of V^(t + e, x(t + e)) 
in the usual form. Instead, 



max E, 



t+t,Xt + f_ 



V^(xt+„t + e) 



[s-t- e)H{x{s), s)ds + 9{T - t - e)F{x{T)) 



(20) 



t+e 



Let u be the maximum in ( fT9i) . Then, by solving Et+e,xt+^F{x{T)) in ([T9 
and fl20p and identifying terms we obtain 



9{T-t- e) 



/t+e 
9{s - t)C{x{s),u, s)ds- 



-Ext^^,t+e 



9{s -t)H{x{s),s)ds 



t+e 



9{T - 1) 



r^(x(t + e), t + e) - E^^^^^t+e [ 9{s - t)H{x{s), s)ds 

Jt+e 



If V^{x, t) is of class C'^'^, applying the Ito rule to V^{x{t + e), t + e), divid- 
ing by e and taking the limit e we recover the dynamic programming 
equations ( ITSlfTTj) . 

Next, we make rigorous the previous reasoning by proving a theorem, 
which is an extension of an standard result in stochastic optimal control 
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theory (see, e.g., Fleming and Rishel (1975) or Fleming and Soner (2006)). 
Let 

d ■ d " d"^ 

be the backward evolution operator, with aij = J2f=i ^i^i^ usual. 

Given initial data (t,x), we call n = {Q, {J-'s}, P, s{-),u{-)) an admissible 
control system if {Q,J-'t,P) is a probability space, {J-^s} is an increasing 
family of cr-algebras {t < s < T), and x{-), u{-) are stochastic processes on 
[if:, T] such that: 

1. x{s) G X for s G [t,T], x{t) = x, the sample paths x{-,u) are right 
continuous and have left hand limits, and x{s) is jF^-measurable; 

2. u{s) G f/ for s G [t,T], u{s) is jF^-measurable and u{-, ■) is measurable; 

3. For all function $(t,x) of class C^'^ with polynomial growth of $ and 

satisfying Et,^ I $(T,x(T)) I < oo, and E^,^ |A"W$(s, x(s))|rfs < 
oo, the Dynkin formula holds: 



Et,,^T, x{T)) - <l>(t, x) = Et,. 



s, x{s)) ds 



Then we have: 



Theorem 1 Let V^{t^x) he a function of class C^'"^ with polynomial growth 
of and A^V^ , solution to (W\T1\) . If there exists an admissible system 
TT* = {Q* , {J^* s} , X* {■) , u* {■)) such that u*{s) solves the right hand term in 
equation ( [73]) for LebesguexP*- almost all {s,uj) G [t,T] x Q* , then 



V^t, x) = Et,^ |y e{s- t)C{s, x{s),u*{s)) ds + e{T - t)F{x{T)) 

Proof: For an admissible control system vr* satisfying the conditions in the 
Theorem, since u*{s) G f/, from ffT^ we have 

-v4"*("V^(t, x(t)) = C{t, x{t), u*{t)) - r{T - t)V^{t, x{t)) - K{t, x{t)) . 



From the Dynkin formula and the above expression we obtain 



V^{t,x)=Et,. 



-A"*(^V^(s, x{s)) ds + F{x{T)) 



(21) 



E. 



t,x 



{C{s, x{s),u*{s)) - r{T - s)V^{s, x{s)) - K{s, x{s))) ds + F(x(T)) 
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where 



K{s, x{s)) ds 



Next, note that 



dsr{T-s) ^ dT9ir-s)Cir,xir),u*{T)) 

dsr{T-s)V%s,x)~ dse{T - s)r{T - s)F{x{T))^ 
= Et,.y^ dsr{T - s)V'{s,x) - F{x{T)) + 9{T -t)F{x{T))^ . 



By substituting in fl?T|) and simphfying we obtain 



V'{t,x)=Et,^ 



C{s, x{s), u*{s)) ds— 



(22) 



ds 



drOij — s)r{T — s)C{t,x{t),u*{t)) 



+ e{T-t)F{x{T)) \ . 



Finally, note that 



Et-x 



ds 



drOij — s)r{T — s)C{t,x{t),u*{t)) 



E, 



t,x 



dr C{t, x{t), u*{r)) 



ds 9(t — s)rfr — si 



E, 



t.x 



T 



dr (1 - 9{r - t))C{r, x(r), M*(r)) 



Then the result follows by substituting the expression above in ( l22i) . □ 
In the consumption and portfolio rules problem analyzed in this paper, 
the stochastic differential equations are linear (in fact, the equilibrium control 
rules are also linear in the state variable), and satisfy the hypothesis for the 
existence and uniqueness of solutions and for the Dynkin formula. For more 
details on admissible controls and conditions under which the Dynkin formula 
holds, see, e.g., Fleming and Soner (2006) or Fleming and Rishel (1975). 
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4 Optimal Portfolios for Time-Inconsistent 
Investors 



4.1 General Setting 

In this section, we analyze the consequences of introducing a non-constant 
discount rate in time preference into the classical solution by Merton (1969, 
1971) for the optimal consumption and portfolio problem. Let us briefly 
describe the basic parameters of the problem. 

The standard Ito processes model for a financial market consists of (m+1) 
securities. One of them is risk-free (a cash account, for instance), and the 
price Po{t) of 1 unit is assumed to evolve according to the ordinary differential 
equation = f^odt, where /iq > and -Po(O) = po > 0- There are also 

m risky assets (stocks, for instance), whose prices Pi{t), i = 1, . . .m, evolve 
according to a geometric Brownian motion stochastic process: 

— — = iiidt + aikdzk[t) , i = 1, . . . m , (23) 

'^^ ' k=i 

where -Pj(O) = pi > 0, {zi(t), . . . ,ZL{t)) is an L-dimensional standard Brow- 
nian motion process, and %(t) are mutually independent Brownian motions. 
For the sake of simplicity, we will assume that /xq and the drift vector of the 
risky assets /i = (/ii, . . . , /i^) are constant. 

From the diffusion matrix S = ((Xjfc), i = 1, . . . , m, k = 1, . . . , /, we can 
define the variance-covariance matrix S = S ■ S' = (o"tj), i,j = l,...m, 
whose coefficients are given by aij = Ylk=i ^ik(^jk- Note that S is symmetric 
((jjj = (jji). We will assume that S is positive definite. In particular, this 
implies that an > (all m risky assets are indeed risky) and S is nonsingular 
(det S > 0). Elements an are usually denoted by af, hence ctj = {auY^"^. 

By defining Zi{t) = ^ Ylk=i ^ik^kit), which are correlated standard Brow- 
nian motions with Cov {zi(t), Zj(t)) = -^^t, Equation (1^ becomes 

dPi = fiiPidt + aiPidzi , i = 1, . . .m , 

with dzidzj = pijdt for i,j = l,...m, where pij = Therefore, the 

problem for the t-agent consists in solving ([5]) subject to (jlj). 

In terms of the Wiener L-dimensional process with independent compo- 
nents {zi, . . . , zl), the budget equation is 



dW 



^ Wiifii - fio)W + {fioW - c) 



i=l 



ds + y^ Y^ wiaikWdzk . (24) 



1=1 k=l 
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Let us briefly recall the solution with a constant discount rate p. In this 
case the agent must solve the HJB equation 



pV — Vt = max <^ u{c) + 



^ Wj{pj - po)W + (pqW - c) 
.i=i 



Vw+ 



^ m m \ 

j=l k=l J 



(25) 



By solving the maximization problem in Wj, i 
standard optimal portfolio rule 

Vw 



, m, we obtain the 



w 



-S ^{p - po-1) 



(26) 



WVww 

where p = {pi, . . . , pm) and 1 = (1, . . . , 1). As for the optimal consumption, 
from the maximization problem in c in Equation (1251) we obtain 



u'(c) = Vw . 



(27) 



Next, let us assume that the discount rate r{t) of time preference is non- 
constant. For the general case, let us describe the so-called pre-commitment 
solution, and the solution for naive and sophisticated agents. 

Pre-commitment Solution: If the 0-agent can precommit his future be- 
havior, he must solve the corresponding HJB equation 



r{t)V^ - Vf = max { u{c 

{c,Wi} 



Wj{pj -po)W + {poW - c) 



:28] 



^ rn rn 

j=i k=i 

Since the right hand term in ( 128|) coincides with that in ( l25i) . then the optimal 
consumption and portfolio rules are given by (ITH) and fpUj) . respectively, with 
V replaced by V^. 

Solution for a Naive Agent: Naive t-agents will solve the problem by 
looking for the solution to the HJB equation 



r(r - t)V'^ - 1/^ = max <( n(c) 

{c,Wi} 



Wj{pj -po)W + {poW - c) 

.i=i 



j=l k=l J 



(29) 
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where the value function for the naive t-agent is V^{W,t), for r G [t,T]. 
From the maximization problem in ( l29l) we again obtain that the optimal 
consumption and portfolio rules are given by (^71) and (PUI) . with V replaced 
by V^^. In order to construct the actual trajectory, we will patch together 
the solutions 

Solution for a Sophisticated Agent: From Theorem 1, in order to solve 
Problem ([5]) subject to (l24l) . we analyze the modified HJB equation ( fTSll . 
which for our particular problem becomes 



r{T-t)V^+K-Vf = max <J u{c) + 

{c,Wi} 



ij=i 



^ m m I 

+ 2^Y. ^^^kCTjkW'V^^ , (30) 

j=l k=l J 

where K is given by f |T71) . Once again, the optimal consumption and portfolio 
rules are given by (EZD and (EED, with V replaced by 1/^. 

From Equation (l26l) it becomes clear that, if for every non-constant dis- 
count rate of time preference r(s), tttt^ — is constant, then the investment 
strategy will be independent of r(s) and it will be observationally equivalent 
to the constant discount rate case. From the solution to the problem in Mer- 
ton (1969, 1971), natural candidates for this observational equivalence are 
the CRRA (constant relative risk averse) utility functions, namely the loga- 
rithmic and potential functions. In the remaining subsections of the paper, 
we show how this observational equivalence exists in the investment strat- 
egy (not in the consumption rule) for the logarithmic and potential utility 
functions, but not for more general utility functions, such as the exponential 
CARA (constant absolute risk averse) utility function. 



4.2 Logarithmic Utility Function 

First of all, let us analyze the log-utility case, u{c) = Inc, with final function 
F{W{T)) = a\n{W{T)). 

In the case of a constant discount rate p, the agent must solve the HJB 
equation (l25l) . From a symmetry argument (see, for instance, Boyd (1990) 
and Chang (2004), pp. 193-194) it can be proved that V{W,t) = a{t) \nW + 
(3{t). In fact this symmetry argument can be applied to all the solutions in 
this section. From flTTj) we obtain c = (VV)~^ = W/a{t) and, by substituting 
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in (126|) the optimal portfolio rule becomes 

w = - fio ■ 1) . (31) 

By substituting in fl^ . the choice of the value function proves to be consis- 
tent and the optimal consumption rule is determined by 

pWt 



cit) 



1 - (1 -ap) e-P^T-t) 



(32) 



Next, for the general case of non-constant discounting, we solve and com- 
pare the solutions for pre-commitment, naive and sophisticated agents. 

Pre-commitment Solution: We must solve equation (!28|) . Once again, we 
know that the solution will be of the form V^{W,t) = a^(t) \nW + (3^{t). 
Then, the optimal consumption and portfolio rules are given by c = W/a^{t) 
and (1311) . respectively. By substituting in (l28l) . we obtain that a^{t),[3^{t) 
are the solution to the first order linear differential equation system 



(3P - r{t)(3'' + 



- r{t)a^ + 1 = 0, 
-(p - Po ■ l)'S"^(p - po ■ 1) + A^o 



(33) 



- Ina^ -1 = 



with (y.P{T) = a, P^(T) = 0. By solving a^it) we obtain 



ie{T) + e{s) ds 



(34) 



Solution for a Naive Agent: Naive t-agents will solve the problem by 
looking for the solution of the HJB equation ( l29i) . By guessing {W, r) = 
a^(r) In Vr + /3^(r), we obtain ^ and c(r) = W/a^{T) where a^(r) is the 
solution to the first order linear differential equation a'^ — r{T — t)a^ + 1 = 0, 
a^iT) = a, which is given by 



«^(r) 



(r-t) 



ad{T-t) + d{s-t)ds 



(35) 



Since the t-agent will not be time consistent for t > t, the actual consumption 
rule is obtained from the equation above for the case t = t, and therefore 



ie{T -t) + /; e{s - 1) ds 



(36) 



Solution for a Sophisticated Agent: A sophisticated agent will look 
for the solution of the modified HJB equation (IHUjl . with K given by ffTTj) . 
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From the maximization problem, the equihbrium consumption and portfoho 
rules are c^(t) = and ( 13T|) . Since the value function satisfies the same 

symmetry as in the previous problems, it will be necessarily of the form 
V^{W, t) = a^(t) \nW + P^(t). Let us verify the integro-differential equation 
(j30ll by the candidate solution. If the choice proves to be consistent, then 
c^it) = W/a^(t). The solution to the stochastic differential equation (IMl) is 
W{s) = Wtexp[At{s)], where 



AAs) 



m \ ^ f 

+ ^Wi(/Xi-/io)--^ ] 

i=l i=l \k 



(s-ty 



m L 



^ ^'^l i=l k=l 



Then, 



K = E 



^ Wis) ■ 

e{s - t) [r{s -t)- r{T - t)] In ds 



9{s - t) [r{s - t) - r(r - t)] [in Wt + At(s) - In a^(s)] ds 



E 



By substituting in flHOj) and simplifying we obtain 



\TlWf 



[s - t)[r{s -t)- r{T - t)] ds - a^{t) + r{T - t)a^{t) - 1 
9{s - t)[r{s -t)- r{T - t)] [At{s) - Ina^(s)] ds - r{T - t)(3%t) + 



Si 



m ^ m 



a^(t) - 1 . 



1=1 j,k=l 

Since the equation above must be satisfied for every Wt, then necessarily 



- r{T - t)a'' + 1 



s 



9{s-t) [r{s -t)-r{T- t)] ds . (37) 



Here, we can compare equation ( 1331) describing the precommitment solution, 
and equation (!H7|) for the sophisticated agent. If the discount rate is constant, 
then r{t) = r{T — t) = p and the integral term in fl37p vanishes. Otherwise, it 
contributes to the solution. Using that 6{s — t)r{s — t) ds = —6{s — t)\J = 
—9{T — t) + 1 we obtain 

aS _ r{T - t)a^ = -9{T -t)- r{T -t) j 9{s - t) ds . 
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The general solution to this first order linear differential equation, with the 
boundary condition a^(T) = a, is a ^{t) = ae{T -t) + /; e{s - 1) ds. Hence, 

c^it) = % , (38) 

ad{T -t) + e{s-t)ds 

which coincides with the solution obtained for a naive agent. Of course, this 
is a special feature of the logarithmic utility function, as we show next. 

4.3 Potential Utility Function 

Next, let us study the problem with a (isoelastic) potential utility function, 
u{c) = c^/-f, 7 < 1, 7 ^ 0, with final function F{W{T)) = a[W{T)]^/-f. 

As above, first we recall the solution with a constant discount rate p. 
From the right hand term in ( l25i) we obtain ( 1271) and ( l26l) . As a candidate to 
the value function we guess V(W, t) = a(t)[W(t)]'^ /■y. Once again, this choice 
is justified in Boyd (1990) from a symmetry argument, which is also applied 
to the pre-commitment, naive and sophisticated solutions. From fl27p we 
obtain c = {a{t))~~W and, from (l26ll . the optimal portfolio rule becomes 

w=-^E-^(/i-/io-l). (39) 
1-7 

By substituting in ( l25l) we get a Bernoulli equation, and solving it we obtain 

c{t) = ^ ^ . ^ , (40) 

a (p - 5P) + (1 - 7) (e—^^''^ - 

where 

5' = m + (/i - /^o ■ l)'S-i (/i - Po ■ 1) . (41) 

/ i — 7 

Next, we assume that the discount rate of time preference is non-constant. 



Pre-commitment Solution: We guess V^{W^t) = {t)[W (t)]"' / 'j 
Then, the equilibrium consumption and portfolio rules are given by c = 
{a^{t))^~W and (1391) . respectively. By substituting in (1281) we obtain that 
a^{t) is the solution to the BernouUi equation = (r(t) — 6^)0^ — {1 — 
'y){a^)~~ , Ci^{T) = a, where 5^ is given by fj^Tl) . By solving it we obtain 



c'^it) = —-^^ L__ _ . (42) 
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Solution for a Naive Agent: We must solve the HJB equation ( 129|) . By 
guessing V^(W,t) = d'^ {t)[W (r)]'^ / ■y and substituting in ( l29l) we obtain 
and c(r) = {a^ {t))~~^W , where Q;^(r) is the solution to = (r(r — 
t) — 5^)a^ — (1 — 7)(a^)~~, a^{T) = a, which is given by 



e{r-t) 



1 



The actual consumption rule, which is obtained for r = t, is determined by 
e"(*) = , ^' 7 . (43) 



t \e{T-t) 

Solution for a Sophisticated Agent: Let us look for the solution of 
the modified HJB equation flHUl) . with K given by ffT7|) . Once again, we 
guess V^iW^t) = (t)[W (t)]^ / 'J and verify the integro-difi^erential equation 
( l30l) for this solution. By substituting in ( l30l) we obtain ( l39ll and c(t) = 
{a^ {t))^~W , where a^(t) is the solution to the integro-differential equation 

= {r{T - t) - 6P)a^ - (1 - 7)(a^)"^+ (44) 

T 



where A(r) = ^uq + (a* ~ /^o ■ 1)' (yU — yUo ■ 1) — (a;"^(r))^~. In com- 
parison with the Bernoulli equation describing the precommitment solution, 
in (jH]) r(t) is replaced by r(T — t), and a new integral term appears, turning 
the Bernoulli equation into a very complicated highly non-linear integro- 
differential equation. 



4.4 Exponential Utility Function 

Finally, let us solve the problem for the constant absolute risk aversion utility 
function u{c) = —e~'^'^/'y, 7 > 0, with final function F{W{T)) = —ae~^^. 

In the constant discount rate case, we guess V{W, t) = —ae~"'^°'^^'^^^^^^^^ 
with a(T) = 0, p{T) = 1 (Boyd (1990), Chang (2004), pp. 193-194), 
and once again we can replicate the same symmetry argument for the pre- 
commitment, naive and sophisticated solutions. We proceed as before to 
obtain 

c = a{t)+mW-^^^^^^^ (45) 

7 
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and 

By substituting in ( !25l) . after several calculations we obtain 

a(t) = -le-^*^^(^)'^^ r [6'{s) - p] e^sP^-)dr^s ^ (43) 
7 it 

where 5^{t) = (3{t) - - /iq ■ - /xq ■ 1) - /3(t)ln (a7/3(t)). 

If the case of non-constant discounting we obtain the following solutions: 

Pre-commitment Solution: By guessing V^{W,t) = — ae"'^*^"^*^*^"'"'^^*^*-'^'', 
the associated equilibrium consumption and portfolio rules are given by (1451 - 
I46|) . with a{t),l3{t) replaced by (t) , (t) . Moreover, (3^{t) = (3{t), and 

T 



7 it 



Solution for a Naive Agent: We guess V^(W,t) = -ae-T^^'^^^^+^^M^) 
in equation (|29l) . As above, the consumption and portfolio rules coincide 
with those in ( H51H61) . with a(t),P{t) replaced by a'^ (t) , (t) . Once again, 
(3^{t) = Pit). Since a^(r) = -ie-Zr^/^Wds jT ^^e(^) _ _ g/^^ rfs, 
taking r = t we obtain 

7 Jt 

Solution for a Sophisticated Agent: In order to solve the modified HJB 
equation ([30]), with K given by (HZ!), we guess V^{W, t) = -ae-^("''W+^''W^). 
Let us verify the integro-differential equation ( l30l) for this solution. The 
consumption and portfolio rules are given by (I45ti46p . with a(t), P(t) replaced 
by a^(t),P^(t). By substituting in (IHUj) we obtain 



-ar(r-t)e-^("'W+'3'W^) + K(W^, t) -a7(«^(t) +/?^(t)iy)e-^(°'W+^'W^) = 
= -a/?^(t)e-^(-'W+^'W'^) + ^(^ - ^0 • l)'S-^(/x - ^0 ■ l)e-^(°"W+^'W^) + 



+ (^poW - a^(t) - P'^m + j a7/3^(t)e-^("'W+'^"W^) (51) 

where, from (fT7|) and (H5l) . 



K = E 



T 



e{s - t) [r{s -t)- r{T - t)] (^-ap^ {s)e-^^'''^'^+^'^'^^^'^^^ ds 



(52) 



23 



From the previous results, let us assume that P^(t) = P(t) (see ( H7j) ): i.e., we 
verify the integro-differential equation ([30]) for V^{W, t) = -ae-'^("''(*)+^(*)^). 
Let us calculate the contribution of the integral term K{W,t). Note that 
equation can be written as dW{s) = [(/io — P{s))W + B{s)] ds + C{s)dz, 
W{t) = Wt, where B{s) = ^^[^^-^^,. (/i - /io ■ 1) - a'{s) + ^^^^^ 

and C{s) 



W{s) 



— fio ■ ^S. The solution is 

= eI:i^^o-f3{r))dT Wt + 5(s)e-//(A'o-/^(^))^^ ds+ 

(3{t)Wt+ [ (3{s)B{s)ds+ [ (3{s)C{s)dz 



where we have used that e-^« l3{r))dT _ Pify^ Then fl52D becomes 



K{Wt,t) = -ae 



E 



e{s - t)[r{s - t) - r{T - t)](3{s)- 



and (|5T1) can be rewritten as 

-r(T-t) + A(«^, t) -7«^(t) +/3(t) - ^ (/i-^o ■ l)'S-i(^-/io ■ 1) +7«^(^)/5(^)- 

-/3(t)ln (a7/3(t)) - 7 (/j(t) + /io/3(t) - /^'(t)) = 

Since ( 1471) is the solution to the differential equation I3{t) + fioP{t) — P^(t) = 
with the boundary condition /3(T) = 1, the assumption P^{t) = P(t) 
is consistent with the integro-differential equation (1511) . Note that, in this 
case, the integral term K{W, t) does not contribute to the calculation of 
P^it). With respect to a^{t), it is the solution of a very complicated integro- 
differential equation. 



4.5 Comparison between the different solutions 

First of all, we summarize the results obtained for the log- utility in Table 1. 

A relevant property of the logarithmic utility function is that the portfolio 
rule is independent of the discount factor, and it is the same for the pre- 
commitment, naive and sophisticated solutions. Concerning the consumption 
rule, it coincides for naive and sophisticated agents. This is a remarkable 
property, in the sense that naive and sophisticated behaviors are completely 
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Consumption rule 


Portfolio rule 


Ptj.\ 0{t)Wt 
^ ^ ^ ~ a0{T)+f^^ 0{s)ds 


W"^ V! ^(ll //n • 




N(4. \ Wt 

^ > ~ ae{T-t)+f^ e{s-t)ds 


= S-^(/i - /io ■ 


1) 


^S(^\ _ Wt 

^ ' ae{T-t)+f^ 9{s-t)ds 




1) 



Table 1: Logarithmic utility function. 



different in nature. However, this result is not surprising, since it coincides 
with that obtained by PoUak (1968) for the Strotz's model. 

With respect to the pre-commitment solution, it will be different in gen- 
eral to the naive and sophisticated solutions, unless r{t) is constant. Let us 
denote by A^, and A"^ the propensity to consume for the pre-commitment, 
naive and sophisticated solutions, respectively (c(t) = \(t)Wt). Since r(s) is 
nonincreasing, then r^r) dr > j^^^r{T)dT = J^^^ r (t) dr — J^^ r (t) dr . 
Therefore, e{s)e{t) < e{s + t). In particular, e{T - t)e{t) < 9{T) and 
e{s - t)e{t) < e{s), so X^{t) < X^{t) = X^{t), i.e., naive (and sophisticated) 
agents overconsume compared with the pre-commitment solution. 

Unlike the Ramsey model (see Barro (1999)), for the logarithmic util- 
ity function the naive (or sophisticated) solution will be in general non- 
observationally equivalent to the standard solution with some constant dis- 
count rate p. Note that, for such observational equivalence, it is neces- 
sary and sufficient that a(t) = a^it), for some p, i.e., from ( l32l) and ( l38ll . 
(a - e-^(^"*) + ^^=ae{T~t) + J^e{s-t)ds. If a = 0, then by differen- 
tiating with respect to t we obtain that 9{T — t)= e~''(^~*) and the discount 
factor is constant. For a 7^ 0, if 6(T — t) ^ e'~^^'^~''^\ by solving a we obtain 

By defining x{t) = ^ (l — e^^*^-^^*^) — 0{s — t) ds, the above equation be- 
comes a = Ipj, therefore x{t) = Ae*/". By identifying x{t) with the denomi- 
nator in the right hand term in ( l53l) we obtain 9(T — t) = ^e^^'^ + e"^*-^"*-*. 
Hence, there will be observational equivalence with a standard model with 
constant discount rate p if, and only if, a 7^ and 6{t) is a linear combination 
of two exponentials with constant discount rates p and 1/a. 

We recover observational equivalence in the limit T 00. In this case, 
the propensity to consume in the case of a constant discount rate p is A = p. 
Under non-constant discounting, if we assume that limt^oo r{t) = f > 0, the 
propensity to consume of naive and sophisticated agents becomes constant. 
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= \^ = 1/ 6{T)dT. Then, in the hmit T — * oo, we obtain observational 
equivalence with effective discount rate p = [J^ 9{T)dT]~^ . 

Table 2 presents the results obtained for a potential utility function. 



Consumption rule 


Portfolio rule 








= j^^-\fi- Ho 


1) 










1) 




cHt) = 


[a^Y~ Wu a^{t) given by ^ 




= Y^S-i(/i-/io 


1) 



Table 2: Potential utility function. 



Again, the portfolio rule is independent of the discount factor, and coin- 
cides for the three solutions. However, the coincidence of the consumption 
rule of naive and sophisticated agents in the logarithmic case is not preserved. 
A possible interpretation for this result can be found in Marin-Solano and 
Navas (2008) (see also Barro (1999)). Since the integral in equation (jHl) dis- 
appears in the logarithmic case (which is the limit when 7 — of the utility 
function u{c) = {c^ — l)/7), the sophisticated t-agent decides his optimal 
consumption at time t without being affected by his future selves and, in 
some way, behaving as an homogeneous decision-maker, similarly to a naive 
agent. 

In the limit T — > 00, if and r{t) > 6^, the propensities to consume A"^, 
are 

m 



where 9{s) - 
A^ 



A^ 

e~fo ~^{-r)dr ^ with f(r) = r(r) 
1 



(54) 



1-7 



ds 
6^, and 



t) 1-t dr 



l-7-r[r(r)-f] 9ir) 



(55) 



1-7 



dr 



For t = 0, the propensities to consume given by the naive and the precommit- 
ment solutions coincide, but (A^)'(O) < ((A^)'(O) = 0), as expected. In gen- 
eral, since /o~^(r)^ dr = [1 -7 + /o°°(r(0) -r(r))(^(r))T^ rfr]/(r(0) - 5^), 
from ([55]) we get (f - 5p)/{1 - 7) < A^ < (r(0) - 5p)/{1 - 7). With respect 
to A^, 

\S ^ r - 6P 

1 _ ^ _ J- [r{r) - f] e{T)e^^- dr ^ ' 
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and therefore A'^ > (f — — 7). Moreover, since the propensity to 

consume under an instantaneous discount rate p is A = (p — — 7), 

we obtain observational equivalence for naive and sophisticated agents (with 
different effective discount rates p = (1 — 7)A^ + 5^ and p = (1 — 7)A'^ + 5^, 
respectively). 

Finally, note that the coincidence of the portfolio rule for the different 
behaviors of the decision maker in the case of CRRA utility functions is no 
longer satisfied for more general HARA utility functions, such as the expo- 
nential (CARA) utility function. Although in the exponential case the ex- 
pression of the portfolio rule (as a function of {W, t)) is the same for an agent 
with constant discount rate and the different solutions with non-constant 
discounting (they are all given by (Uni), with /3{t) = /3^{t) = /3^{t) = /?^(t)), 
this property does not imply that the portfolio rule is independent of the dis- 
count factor, since the evolution of W(t) depends on the values of a(t), a^(t), 
a'^{t) and a^(t), respectively, and all these functions do not coincide, in gen- 
eral. The consumption rule is also different. It is easy to check that, at time 
t = 0, a^(0) = a^(0) but (a^)'(O) < (a^)'(O), hence (c^)'(O) > (c^)'(O). 

5 Concluding remarks 

In this paper we study the problem of searching for optimal/equilibrium rules 
in the case where decision-makers have time-inconsistent preferences, within 
a stochastic framework. For the so-called sophisticated agents, we derive a 
modified HJB equation which extends the equation for a deterministic prob- 
lem (see Karp (2007) and Marin-Solano and Navas (2008)). This equation 
is obtained, first, heuristically, and it is mathematically justified later on, by 
following a different approach. Although this modified HJB equation seems 
to be too complicated in general, we illustrate with several examples how 
it can be managed in order to obtain information about the solution. In 
particular, this modified HJB equation is used in order to solve (for some 
utility functions) the classical consumption and portfolio rules model when 
the instantaneous discount rate of time preference is non-constant. 

A relevant result is that for the CRRA (logarithmic and potential) utility 
functions, the portfolio rule coincides for the pre-commitment, naive and 
sophisticated solutions. Moreover, it is independent of the discount factor, 
and thus coincides with the standard solution when the discount factor is an 
exponential with constant discount rate. This property is no longer satisfied 
for more general utility functions, such as the CARA (exponential) function. 
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With respect to the consumption rule, it is proved that, in the log-utihty 
case, it coincides for naive and sophisticated agents. This is a remarkable 
property which extends to the Merton's model a similar result already an- 
nounced in Pollak (1968) for the Strotz model. This coincidence is no longer 
satisfied for more general utility functions. In the log-utility case, the ob- 
servational equivalence problem first studied for the Ramsey model in Barro 
(1999) is analyzed, with a negative answer except for a very particular form 
of the discount factor. 

Possible extensions of the results in the paper include several financial 
and actuarial applications, such as contribution and portfolio selection in 
pension funding (see, e.g., Josa-Fombellida and Rincon-Zapatero (2008) and 
references therein). 
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